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Abstract 

We review theory of information thermodynamics which incorporates effects 
of measurement and feedback into nonequilibrium thermodynamics of a small 
system, and discuss how the second law of thermodynamics should be extended 
for such situations. We address the issue of the maximum work that can be 
extracted from the system in the presence of a feedback controller (Maxwell's 
demon) and provide a few illustrative examples. We also review a recent exper- 
iment that realized a Maxwell's demon based on a feedback-controlled ratchet. 
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1 Introduction 

The profound interrelationship between information and thermodynamics was first 
brought to light by J. C. Maxwell [T] in his gedankenexperiment about a hypothetical 
being of intelligence which was later christened by William Thomson as Maxwell's 
demon. Since then, numerous discussions have been spurred as to whether and how 
Maxwell's demon is compatible with the second law of thermodynamics 

To understand the roles of Maxwell's demon, let us consider a situation in which 
a gas is confined in a box surrounded by adiabatic walls. A barrier is inserted at the 
center of the box to divide it into two. The temperatures of the gases in the two boxes 
are assumed to be initially the same. We assume that the demon is present near the 
barrier and can close or open a small hole in the barrier. If a faster-than-average 
(slower-than-average) molecule comes from the right (left) box, the demon opens the 
hole. Otherwise, the demon keeps it closed. By doing so over and over again, the 
temperature of the left gas becomes higher than that of the right one, in apparent 
contradiction with the second law of thermodynamics. This example illustrates the 
two essential roles of the demon: 

• The demon observes individual molecules, and obtains the information about 
their velocities. 

• The demon opens or closes the hole based on each measurement outcome, which 
is the feedback control. 

In general, feedback control implies that a control protocol depends on the measure- 
ment outcome, or equivalently, we control a system based on the obtained informa- 
tion ISIE]. The crucial point here is that the measurements are performed at the 
level of thermal fiuctuations (i.e., the demon can distinguish the velocities of indi- 
vidual molecules). Therefore, Maxwell's demon can be characterized as a feedback 
controller that utilizes information about a thermodynamic system at the level of 
thermal fiuctuations (see also Fig. 1). 

In the nineteenth century, it was impossible to observe and control individual 
atoms and molecules, and therefore it was not necessary to take into account the 
effect of feedback control in the formulation of thermodynamics. However, due to the 
recent advances in manipulating microscopic systems, the effect of feedback control 
on thermodynamic systems has become relevant to real experiments. We can simulate 
the role of Maxwell's demon in real experiments and can reduce the entropy of small 
thermodynamic systems. 
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Figure 1: Maxwell's demon as a feedback controller. The demon performs feedback 
control based on the information obtained from measurement at the level of thermal 
fluctuations. 

In this chapter, we review a general theory of thermodynamics that involves mea- 
surements and feedback control [71-Hl]. We generalize the second law of thermody- 
namics by including information contents concerning the thermodynamics of feedback 
control. We note that, by the "demon," we mean a type of devices that perform feed- 
back control at the level of thermal fluctuations. 

This chapter is organized as follows. In Sec. 2, we discuss the Szilard engine which 
is a prototypical model of Maxwell's demon and examine the consistency between 
the demon and the second law. In Sec. 3, we review information contents that are 
used in the following sections. In Sec. 4, we discuss a generalized second law of 
thermodynamics with feedback control, which is the main part of this chapter. In 
Sec. 5, we generalize nonequilibrium equalities such as the fluctuation theorem and 
the Jarzynski equality to the case with feedback control. In Sec. 6, we discuss the 
energy cost (work) that is needed for measurement and information erasure. In Sec. 7, 
we conclude this chapter. 

2 Szilard Engine 

In 1929, L. Szilard proposed a simple model of Maxwell's demon that illustrates 
the quantitative relationship between information and thermodynamics In this 
section, we briefly review the model, which is called the Szilard engine, and discuss 
its physical implications. 

The Szilard engine consists of a single-particle gas that is in contact with a single 
heat bath at temperature T. By a measurement, we obtain one bit of information 
about the position of the particle and use that information to extract work from 
the engine via feedback control. While the engine eventually returns to the initial 
equilibrium, the total amount of the extracted work is positive. The details of the 
control protocol dbTQ clS follows (see Fig. 2). 

Step 1: Initial state. We prepare a single-particle gas in a box of volume Vq, which 
is at thermal equilibrium with temperature T. 
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step 1 : Thermal equilibrium. Step 2: Insertion of the barrier. 




Figure 2: The Szilard engine. See the text for details. 



Step 2: Insertion of the barrier. We insert a barrier in the middle of the box, and 
divide it into two with equal volume Vo/2. 

Step 3: Measurement. We perform an error-free measurement of the position of 
the particle to find which box the molecule is in. Because the particle will be found 
to be in each box with probability 1/2, the amount of information gained from this 
measurement is one bit. We note that one (= logg 2) bit of information in the binary 
logarithm corresponds to In 2 nat of information in the natural logarithm. 

Step 4- Feedback. If the particle is in the left (right) box, we quasi-statically 
expand it by moving the barrier to the rightmost (leftmost) position. By this process, 
we can extract work W^xt given by 

W^ext= / -^dV = k^T\n2, (1) 

JVo/2 V 

where we used pV = k-gT with p, V, and /cb respectively being the pressure, the 
volume and the Boltzmann constant. This process corresponds to feedback control, 
because the direction of the expansion (i.e., left or right) depends on the measurement 
outcome. After this expansion, the gas returns to the initial thermal equilibrium with 
volume Vq. 

The extracted work is proportional to the obtained information with proportion- 
ality constant k^T. This is due to the fact that the entropy of the system is effectively 
decreased by In 2 via feedback control, and the decrease in entropy leads to the in- 
crease in the free energy by ksT \n2, which is the resource of the extracted work. 

The Szilard engine prima facie seems to contradict the second law of thermody- 
namics, which dictates that one cannot extract positive work from a single heat bath 
with a thermodynamic cycle (Kelvin's principle). In fact, the Szilard engine is consis- 
tent with the second law, due to an additional energy cost (work) that is needed for 
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the measurement and information erasure of the measurement device or the demon. 
This additional cost compensates for the excess work extracted from the engine. In 
his original paper, Szilard argued that there must be an entropic cost for the mea- 
surement process [15]. We stress that the work performed on the demon need not be 
transferred to the engine; only the information obtained by the measurement should 
be utilized for the feedback. This is the crucial characteristic of the information heat 
engine. 

By utilizing the obtained information in feedback control, we can extract work 
from the engine without decreasing its free energy, or we can increase the engine's 
free energy without injecting any work to the engine directly. The resource of the work 
or the free energy is thermal fluctuations of the heat bath; by utilizing information 
via feedback, we can rectify the thermal energy of the bath and convert it into the 
work or the free energy. This method allows us to control the energy balance of 
the engine beyond the conventional thermodynamics. We shall call such a feedback- 
controlled heat engine as an "information heat engine." The Szilard engine works 
as the simplest model that illustrates the quintessence of information heat engines. 
Quantum versions of the Szilard engine have also been studied [T21EI1ES1I13]- 

3 Information Content in Thermodynamics 

In this section, we briefly review the Shannon information and the mutual informa- 
tion [ISmZ]. In particular, the mutual information plays a crucial role in thermody- 
namics of information processing. 

3.1 Shannon Information 

Let X G X be a probability variable which represents a finite set of possible events. We 
write as P[x\ the probability of event x being realized. The information content that 
is associated with event x is then defined as — lnP[a;], which implies that the rarer 
an event is, the more information is associated with it. The Shannon information is 
then given by the average of — lnP[x] over all possible events: 



The Shannon information satisfies < H{X) < InN, where N is the number of the 
possible events (the size of set X). Here, the lower bound {H{X) = 0) is achieved if 
P[x] = 1 holds for some x; in this case, the event is indeed deterministic, while the 
upper bound {H{X) = InN) is achieved if P[x] = 1/N for arbitrary x. In general, the 
Shannon information characterizes the randomness of a probability variable; the more 
random the variable, the greater the Shannon information. Consider, for example, 
a simple case in which x takes two values: x = or x = 1. We set P[0] =: p 
and P[l] =: 1 — p with < p < 1. The Shannon information is then given by 
H{X) = —plnp — {1 — p) ln(l — p), which takes the maximum value In 2 for p = 1/2 
and the minimal value for p = or 1. 




(2) 



X 
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3.2 Mutual Information 

The mutual information characterizes the correlation between two probability vari- 
ables. Let X E X and y E Y he the two probability variables, and P[x,y] be their 
joint distribution. The marginal distributions are given by P[x] = P[x, y], P[y] = 
J2x y]- If two variables are statistically independent, then P[x,y] = P[x]P[y]. 
Otherwise, they are correlated. If the two variables are perfectly correlated, the joint 
distribution satisfies 



P[x, y] = 6ix, fiy))P[x] = fiy))P[yl (3) 

where 6{-, ■) is the Kronecker's delta and /(■) is a bijection function on Y. For 
example, if P[0, 1] = P[1,0] = 1/2 and P[0,0] = P[l, 1] = with X = {0, 1} and 
Y = {0, 1}, the two variables are perfectly correlated with /(O) = 1 and /(I) = 0. 



If /(■) is the identity function that satisfies f{y) = y for any y, Eq. (|3]) reduces to 
P[x,y] = 5{x,y)P[x] = 6{x,y)P[y]. 

The conditional probability of x for given y is given by P[x||/] = P[x,y]/P[y]. If 
the two probability variables are statistically independent, the conditional probability 
reduces to P[x|?/] = P[x]. This implies that we cannot obtain any information about 
X from knowledge of y. On the other hand, in the case of the perfect correlation (|3]), 
we obtain P[x|?/] = 6{x,f{y)). This means that we can precisely estimate x from y 
hy x = f{y). 

We next introduce the joint Shannon information and the conditional Shannon 
information. The Shannon information for the joint probability P[x,y] is given by 

HiX, y) := - 5^ P[x, In P[x, y] . (4) 

xy 

On the other hand, the Shannon information for the conditional probability P[a;|?/], 
where x is the relevant probability variable, is given by 

H{X\y) := -J2P{^W^P[^\y]- (5) 

X 

By averaging H{X\y) over y, we define the conditional Shannon information 

HiX\Y) := Y,Ply]HiX\y) = - P[x, lnP[a;|y]. (6) 

y xy 

The conditional Shannon information satisfies the following properties: 

H{X\Y) = H{X, Y) - H{Y), H{Y\X) = H{X, Y) - H{X). (7) 
By definition, H{X\y) > and H{X\Y) > 0. Hence 

H{X, Y) > H{Y), H{X, Y) > H{X), (8) 
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which imphes that the randomness decreases if only one of the two variables is con- 
cerned. 

The mutual information is defined by 

/(X : Y) ■= H{X) + H(Y) - H{X, Y), (9) 

or equivalently, 

/(X:y, = 5:Pl.,,ll„^. (10) 

As shown below, the mutual information satisfies 

< /(X : r) < H{X), < /(X : r) < H{Y). (11) 

Here, /(X : y) = is achieved if X and Y are statistically independent, i.e., y\ = 
P[x]P[y]. On the other hand, J(X : Y) = H{X) is achieved if H{X\Y) = 0, or 
equivalently, if H{X\y) = for any y. This condition is equivalent to the condition 
that, for any y, there exists a single x such that -P[x|?/] = 1, which implies that 
we can estimate x from y with certainty. Similarly, /(X : Y) = H{Y) is achieved 
if H{Y\X) = 0. In particular, if the correlation between x and y is perfect such 
that Eq. ([3]) holds, /(X : Y) = H{X) = H(Y). In general, the mutual information 
describes the correlation between two probability variables; the more strongly x and 
y are correlated, the larger /(X : Y) is. 

The proof of inequalities f lTT]) goes as follows. Because ln(t^^) > 1 — t for t > 0, 
where the equality is achieved if and only if t = 1, we obtain 

, P\x\P\y] , P\x]P\y]\ , , 

which implies /(X :Y) >{). On the other hand, Eq. (|9]) leads to 

/(X : Y) = H{X) - H{X\Y) = H{Y) - H{Y\X), (13) 

which implies /(X : Y) < H{X) and /(X : Y) < H{Y). 

We note that Eqs. ([7]), ([2]), and (IT^ can be illustrated by using a Venn diagram 
shown in Fig. 3 [IH]. This diagram is useful to memorize the relationship among 
H{X\Y), H{Y\X), and /(X : Y). 

The mutual information can be used to characterize the effective information 
that can be obtained by measurements. Let us consider a situation in which a; is a 
phase-space point of a physical system and y is an outcome that is obtained from 
a measurement on the system. In the case of the Szilard engine, x specifies the 
location of the particle ("left" or "right"), and y is the measurement outcome. If 
the measurement is error-free as we assumed in Sec. 2, x = ?/ is always satisfied 
and the correlation between the two variables is perfect. In this case, /(X : Y) = 
H{X) = H{Y) holds, and therefore the obtained information can be characterized 
by the Shannon information as is the argument in Sec. 2. In general, there exist 
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Figure 3: A Venn diagram |l6] illustrating the relationship among different informa- 
tion contents. The entire region represents the joint Shannon information H{X,Y). 

measurement errors, and the obtained information by the measurement needs to be 
characterized by the mutual information. The less the amount of the measurement 
error is, the more the mutual information is. 

We next discuss the cases in which the probability variables take continuous values. 
The probability distributions such as P[x,y] and P[x] should then be interpreted as 
probability densities, where the corresponding probabilities are given by P[x,y]dxdy 
and P[x]dx with dxdy and dx being the integral elements. The Shannon information 
of X can be formally defined as 

//(X):^-/..PM.,>P14 (14) 

However, Eq. ( !T^ is not invariant under the transformation of the variable. In fact, 
if we change x to x' such that P[x]dx = P[x']dx', Eq. f[T^ is given by 

H{X) = - j dx'P[x']\nP[x']- j dx'P[x'] In^'^^. (15) 

Thus, that the Shannon information is not uniquely defined for the case of continuous 
variables. Only when we fix some probability variable, we can give the Shannon 
information a unique meaning. On the other hand, the mutual information is defined 

as 

nX :¥):= j dxdyP[x, y] In (16) 

which is invariant under the transformation of the variables. In this sense, the mutual 
information is uniquely defined for the cases of continuous variables, regardless of the 
choices of probability variables. 

3.3 Examples 

We now discuss two typical examples of probability variables: discrete and continuous 
variables. 
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Example 1 (Binary channel) We consider a binary channel with which at most 
one bit of information is sent from variable x to y [see Fig. 4 (a)]. Let x = 0, 1 be 
the sender's bit and y = 0, 1 be the receiver's bit. We regard this binary channel as a 
model of a measurement, in which x describes the state of the measured system and y 
describes the measurement outcome. We assume that the error in the communication 
(or the measurement error) is characterized by 

P[x = 0\y = 0] = l- So, P[x = 0\y = 1] = So, 

p[a;=l|y = 0]=£i, P[x = l|y=l]=l-£i, ^ ' 

where Eq and ei are the error rates for a; = and x = 1, respectively. The crucial 
assumption here is that the error property is only characterized by a pair {eo,ei), 
which is independent of the probability distribution of x. If Eq = ei =: e, this model 
is called a binary symmetric channel. 




Figure 4: (a) A binary channel with error rates Eq and ei. (b) Mutual information 
I{X : Y) versus error rate e for a binary symmetric channel with eo = ei =: e and 
p = 1/2, which gives I{X : Y) = ln2 + £:ln£: + {1 - e) ln(l - e). 

Let P[x = 0] =: p and P[x = 1] =: 1 — p he the probability distribution of x. 
The joint distribution of x and y is then given by P[x = 0,y = 0] = p{l — sq), 
P[x = 0,y = l]= pso, P[x = l,y = 0] = (1 - p)ei, P[x = l,y = l] = (1 -p)(l - £i), 
and the distribution of y is given by P[y = 0] = p{l — Eq) + (1 —p)ei =: q, P[y = 1] = 
peo + (1 — p)(l — Si) ='■ 1 — 9- By definition, we can show that the mutual information 
is given by 

/(X : Y) = H{Y) - pH{e,) - (1 - p)H{e,), (18) 

where H{Y) := —qlnq — (1 — g)ln(l — q) is the Shannon information for Y, and 
we defined H^ei) := — ejln^j — (1 — £«) ln(l — Ei) for z = and 1. From Eq. ffTSjl . 
I{X : Y) = H{Y) holds for (ec^i) = (0,0), (0,1), (1,0), or (1,1). For a binary 
symmetric channel, Eq. flTK]) reduces to I{X : Y) = H{Y) — H{e). Figure 4 (b) 
shows I{X : Y) versus e for the case of p = 1/2. The mutual information takes the 
maximum value if e = or £ = 1. In this case, we can precisely estimate x from y. 
We note that, if e = 1, we can just relabel "0" and "1" of y such that x = y holds. 

Example 2 (Gaussian channel) We next consider a Gaussian channel with con- 
tinuous variables x and y. Let x be the sender's variable or the "signal" and y be 
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the receiver's variable or the "outcome." We can also interpret that x describes the 
phase-space point of a physical system such as the position of a Brownian particle, 
and that y describes the outcome of the measurement on the system. We assume 
that the error is characterized by a Gaussian noise 

where N is the variance of the noise. For simplicity, we also assume that the proba- 
bihty density of x is also Gaussian: 

P[A-^^w{-^]. (20) 



where S is the variance of x. The joint probability density of x and y is then given 

by 

PI.,.] = Pl.|.]P[.l = -^axp - . (21) 

On the other hand, the probability density of y is given by 

which implies that the variance of the outcome y is enhanced by factor 1 + N/S 
compared with the original variance S of the signal x. We can also calculate the 
conditional probability density as 

^'l*l = ^ = ;7|=^exp(-^(.-^,) j, (23) 

which implies that the variance of the conditional distribution of x is suppressed 
compared with the original variance 5" by a factor of 1 + 5'/A'". 
We can straightforwardly calculate the mutual information as 

I{X:Y)^\\n[l + ^^, (24) 

which is determined by the signal-to-noise ratio S/N alone. In the hmit oi S/N ^ 0, 
where the noise dominates the signal, the mutual information vanishes. On the other 
hand, in the limit of S'/iV —)■ oo where the noise is negligible compared with the signal, 
the mutual information diverges, as the variable is continuous. 



4 Second Law of Thermodynamics with Feedback 
Control 

In this section, we discuss a universal upper bound of the work that can be extracted 
from information heat engines such as the Szilard engine. Starting from a general 
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argument for isothermal processes in Sec. 4.1, we discuss two models with which 
the universal bound is achieved in Sec. 4.2 and 4.3. Moreover, we will discuss an 
experimental result that demonstrates an information heat engine in Sec. 4.4. We 
will also discuss the Carnot efficiency with two heat baths in Sec. 4.5. 

4.1 General Bound 

In the conventional thermodynamics, we extract a work from a heat engine by chang- 
ing external parameters such as the volume of a gas or the frequency of an optical 
tweezer. In addition to such parameter changes, we perform measurements and feed- 
back control for the case of information heat engines. 

Suppose that we have a thermodynamic engine that is in contact with a single 
heat bath at temperature T. We perform a measurement on a thermodynamic engine, 
and obtain / of mutual information. After that, we extract a positive amount of 
work by changing external parameters. The crucial point here is that the protocol 
of changing the external parameters can depend on the measurement outcome via 
feedback control. 

In the case of the Szilard engine, we obtain In 2 nat of information, and extract 
k-^T In 2 of work. How much work can we extract in principle under the condition that 
we have I of mutual information about the system? The answer of this fundamental 
question is given by the following inequality [22|[26]: 

VText < -AF + kBTI, (25) 

where Wgxt is the average of the work that is extracted from the engine, and AF is the 
free-energy difference of the engine between the initial and final states. Inequality fl25p 
has been proved for both quantum and classical regimes • However, the mutual 

information in f l25|l needs to be replaced by a quantum extension of the mutual 
information [22lll8lll9] for quantum cases. We will prove inequality f l25|) for classical 
cases by invoking the detailed fiuctuation theorem in Sec. 5. 

Inequality f l2^ states that we can extract an excess work up to k^TI if we utilize / 
of mutual information obtained by the measurement. In the conventional thermody- 
namics, the upper bound of the extractable work is bounded only by the free-energy 
difference AF, which is determined by the initial and final values of external pa- 
rameters. For information heat engines, the mutual information is also needed to 
determine the upper bound of the extractable work. In this sense, inequality f l2^ is a 
generalization of the second law of thermodynamics for feedback-controlled processes, 
in which thermodynamic variables {W and AF) and the information content (/) are 
treated on an equal footing. 

The equality in fl25|l is achieved with the "best" protocol, which means that the 
process is quasi-static and the post-feedback state is independent of the measurement 
outcome, i.e., we utilize all the obtained information. This condition is achieved by 
the Szilard engine, as discussed above. Some models that achieves the equality in 
( 125|) have been proposed [23 | [M | l35 | Hi] . Two of them [35|H^ are discussed in Sees. 4.2 
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and 4.3. The Szilard engine, which gives W = k^T ln2, AF = 0, and / = In 2, 
achieves the upper bound of the extractable work, and its special role in information 
heat engines parallels that of the Carnot cycle in conventional thermodynamics. 

4.2 Generalized Szilard Engine 

We discuss a generalization of the Szilard engine with measurement errors and imper- 
fect feedback [44j, with which the equality in (125|) is achieved. The control protocol 
is described as follows (see Fig. 5): 



step 1: Thermal equilibrium. Step 2: Insertion of the barrier. 

P 1 -p 




Figure 5: Generalized Szilard engine. See the text for details. 



Step 1: Initial state. A single-particle gas is in thermal equilibrium, which is in 
contact with a single heat bath at temperature T. 

Step 2: Insertion of the harrier. We insert a barrier to the box and divide it into 
two with the volume ratio being p : 1 — p. 

Step 3: Measurement. We perform a measurement to find out which box the 
particle is in. The possible outcomes are "left" and "right," which we respectively 
denote as "0" and "1." The measurement can then be modeled by the binary channel 
with error rates Eq and Ei (see Sec. 3), where x (= 0, 1) specifies the location of the 
particle and y (= 0, 1) shows the measurement outcome. 

Step 4- Feedback. We quasi-statically move the barrier depending on the mea- 
surement outcome. If the outcome is "left" [y = 0), we move the barrier, so that the 
final ratio of the volumes of the two boxes is given by fo : 1 — fo (0 < fo < !)• If 
the outcome is "right" {y = 1), we move the the barrier, so that the final ratio of 
the volumes of the two boxes is given by 1 — f i : f i (0 < fi < 1). We note that the 
feedback protocol is specified by {vo,vi). 
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Step 5: Removal of the harrier. We remove the barrier, and the system returns to 
the initial thermal equilibrium by a free expansion. 

We now calculate the amount of the work that is extracted in step 4- By using 
the equation of states pV = k^T, we find that the extracted work is kBTlnlvo/p] for 
{x,y) = (0,0), fcBTln[(l - for (0,1), k^T ln[il - Vo) / {I - p)] for (1,0), and 

k^Tlnlvi/ll — p)] for (1, 1). Therefore the average work is given by 

^ = p(l-eo) In ^+p5o In ^ + (l-p)ei In i^ + (l-p)(l-ei) In (26) 
kB-l p p 1 — p 1 — p 

The mutual information obtained by the measurement is given by Eq. (|T8l) . The 
upper bound of inequality (125|) is not necessarily achieved with a general feedback 
protocol (uo, Vl). We then maximize We^t in terms of Vq and Vi. The optimal feedback 
protocol with the maximum work is determined by equations dWe:^t/dvQ = and 
dWext/dvi = 0, which lead to Vq = p{l — eo)/q = P[x = 0\y = 0] and Vi = (1 —p){l — 
£i)/(l — q) = P[x = l\y = 1]. Therefore, we obtain the maximum work as 

W^ext = k^TI, (27) 

which achieves the upper bound of the generalized second law (l25i) . 



4.3 Overdamped Langevin System 

We next discuss a feedback protocol for an overdamped Langevin system, which also 
achieves the upper bound of inequality ( 123]) as shown in Ref. We consider a 

Brownian particle with a harmonic potential, which obeys the following overdamped 
Langevin equation: 

V^ = -Xi{x-X2)+m, (28) 

where rj is a friction constant and ^(t) is a Gaussian white noise satisfying (C(t)^(t')) — 
2rik-BT5{t — t') with 5{-) being the delta function. The harmonic potential can be 
controlled through two external parameters (Ai, A2) such that 

y(x,Al,A2) = y(x-A2)^ (29) 

where Ai and A2 respectively describe the spring constant and the center of the po- 
tential. We consider the following feedback protocol (see Fig. 6). 

Step 1: Initial state. The particle is initially in thermal equilibrium with initial 
external parameters Ai(0) =: k and A2(0) = 0. 

Step 2: Measurement. We measure the position of the particle and obtain outcome 
y. The measurement error is assumed to be Gaussian that is given by Eq. fll9p . where 
S is the variance of x in the initial equilibrium state (i.e., 5* = k^T/k), and is 
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step 1: Thermal equilibrium. 




Figure 6: Feedback control on a Langevin system with a harmonic potentiaL See the 
text for details. 

the variance of the noise in the measurement [see Eq. f[T^ ]. Immediately after the 
measurement, the conditional probability is given by Eq. fl25|l . The obtained mutual 
information I is given by Eq. ^IM . 

Step 3: Feedback. Immediately after the measurement, we instantaneously change 
Ai from k to k' := (1 + S/N)k and A2 from to Hy := Sy/ {S + N). By this change, 
the conditional distribution becomes thermally equilibrated with new parameters 
(Ai, A2) = ik',ij,y). 

Step 4-' Work extraction. We quasi-statically expand the potential by changing Ai 
from k' to k thereby extracting the work. The system then get thermally equilibrated 
with parameters (Ai, A2) = {k,fj,y). 

We now calculate the work that can be extracted from this engine. Let P[x,i|?/] 
be the probability distribution of x at time t under the condition of y. The average 
of the work for step 3 is given by 

W^ixt = j [V{x, k, 0) - V{x, k', fly)] P[x, 0\y]P[y]dxdy = 0, (30) 

where P[x, 0|?/] is given by -P[x||/] in Eq. (123|) . and P[y] is given by Eq. ( 122|) . The 
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work for Step 4 is given by 



ext 



(4) ^ 




Ai(t),A2(t)) 





Comparing Eqs. fl30|l and 



(32) 



which achieves the upper bound of inequahty (125|) . 

4.4 Experimental Demonstration: Feedback-Controlled Ratchet 

We next discuss a recent experiment that reahzed an information heat engine by using 
a real-time feedback control on a colloidal particle in water at room temperature |30j . 

In the experiment, a colloidal particle with diameter 300 nm is attached to the 
cover glass, and another particle is attached to the first one [Fig. 7 (a)]. The second 
particle then moves around the first as a rotating Brownian particle which we observe 
and control. An AC electric field is applied with four electrodes, and the particle un- 
dergoes an effective potential as illustrated in Fig. 7 (b). We note that the potential 
can take two configurations depending on the phases of the electric field. Each con- 
figuration consists of a spatially-periodic potential and a constant slope. The slope is 
created by a constant torque around the circle along which the particle rotates. This 
potential is like spiral stairs. The depth of the periodic potential is about SksT, and 
the gradient of the slope per angle 27i is about k^T. 

If the periodic potential without the slope was asymmetric and the two potential 
configurations were periodically switched, the particle would be transported in one 
direction as a flashing ratchet [50145^ . In the present setup, however, the periodic 
potential without the slope is symmetric and is not switched periodically but switched 
in a manner that depends on the measured position of the particle via feedback 
control [TTlEniESlES]. Such a feedback-controlled ratchet has been experimentally 
realized [20]. In the present experiment, the work and the free energy were measured 
precisely for quantitatively comparing the experimental results with the theoretical 
bound (1251) . It has been pointed out that the feedback-controlled ratchet, as 

well as the flashing ratchet, can be a model of biological molecular motors [5B] . 

The feedback protocol in the experiment ^0] was done as follows [see Fig. 7 (c)]. 
The position of the particle was probed every 40 ms by a microscope, a camera, and 
an image analyzer. Only if the particle was found in the switching region described 
by "S" in Fig. 7 (c), the potential configuration was switched after a short delay time 
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e. By this switching, when the particle reached the hilltop, the potential is inverted, 
so that the peak of the potential changed into the bottom of the valley, and therefore 
the particle is transported to the right direction. Without the switching, the particle 
would be more likely to go back to the left valley. This position-dependent switching 
via feedback control induces the reduction of the entropy in a manner analogous to 
the feedback control in the Szilard engine. By performing this protocol many times, 
the particle is expected to be transported to the right direction by climbing up the 
potential slope. 




Figure 7: Experimental setup of the feedback-controlled ratchet (reproduced from 
Ref. [30] with permission), (a) A rotating Brownian particle, (b) Two possible 
configurations of the potential that can be switched into each other, (c) Feedback 
protocol. Only if the particle is found in region "S," the potential is switched. 

Figure 8 (a) shows typical trajectories of the particle. If the feedback delay e 
is sufficiently shorter than the relaxation time of the particle in each well (~ 10 
ms), the particle climbs up the potential. If the feedback delay is longer than the 
relaxation time, the feedback does not work and the particle moves down the potential 
in agreement with the conventional second law of thermodynamics. Figure 8 (c) 
shows the averaged velocity of the particle versus e, which implies that the shorter 
the feedback delay is, the faster the average velocity is. 

Figure 8 (c) shows the energy balance of this engine. The shaded region is pro- 
hibited by the conventional second law of thermodynamics (AF — W) < 0, where 
AF is the free-energy difference corresponding to the height of the potential, W is 
the work performed on the particle during the switching, and (■ ■ ■ ) represents the 
ensemble average over all trajectories. By using information via feedback, however, 
the shaded region is indeed achieved if e is sufficiently small. The resource of the 
excess free-energy gain is thermal fluctuations of the heat bath, which are rectifled by 
feedback control. This is an experimental realization of an information heat engine. 

For the case oi e = 1.1 ms, (AF — W) = 0.062A;b^- On the other hand, the 
obtained information is given by / = 0.22, which can be calculated from the histogram 
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Figure 8: Experimental results on the feedback-controlled ratchet (reproduced from 
Ref. [SU] with permission), (a) Typical trajectories of the particle with feedback 
delays e = 1.1 ms, 8.8 ms, and 35.2 ms. (b) The averaged velocity of the particle 
versus the feedback delay, (c) Energy balance of feedback control. The shaded region 
is prohibited by the conventional second law of thermodynamics, and can only be 
achieved by feedback control. 

of the measurement outcomes by assuming that the measurement is error-free. By 
comparing this experimental data with the theoretical bound f l25|l . the efficiency of 
this information heat engine is determined to be {AF — W)/k^TI = 0.062/0.22 ~ 
0.28. The reason why the efficiency is less than unity is twofold: (1) the switching is 
not quasi-static but instantaneous, and (2) the obtained information is not utilized if 
the particle is found outside of the switching region. 

We note that various experiments that are analogous to Maxwell's demon have 
been performed with, for example, a granular gas [STflSS]. supramolecules [59], and ul- 
tracold atoms [60j; however, these examples do not explicitly involve the measurement 
and feedback, as the controlled system and the demon constitute an autonomous sys- 
tem in those experiments. Such autonomous versions of Maxwell's demon have also 
been theoretically studied [6TH66] . In contrast, in the experiment of Ref. [30], the 
demon (the camera and the computer) is separated from the controlled system (the 
colloidal particle) as in the case for the Szilard engine. We also note that an informa- 
tion heat engine similar to that in Ref. [SU] has been proposed for an electron pump 
system in Ref. [39] . 

4.5 The Carnot Efficiency with Two Heat Baths 

We next consider the case in which there are two heat baths and the process is a 
thermodynamic cycle. Without feedback control, the heat efficiency is bounded by 
the Carnot bound. If we perform measurements and feedback control on this system. 
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the extractable work is bounded from above by ^2] 

W^ext < - Qh + k^ni, (33) 

where Th and Tl are the temperatures of the hot and cold heat baths, respectively, 
and Qh is the heat that is absorbed by the engine from the hot heat bath. The proof 
of inequality f l33|) will be given in Sec. 5. The last term on the right-hand side (rhs) 
of (ES]) describes the effect of feedback. We note that the coefficient of the last term 
is given by the temperature of the cold bath. 

The equality in (133|) is achieved in the following example. We consider a single- 
particle gas in a box, and quasi-statically control it as in the case of the usual Carnot 
cycle. We then perform the Szilard-engine-type operation consisting of measurement 
and feedback on the system while it is in contact with the cold bath. In this case, we 
have ksTj^ In 2 of excess work, and Qu remains unchanged. Therefore, the equality in 
f l33|l is achieved with / = ln2. 

We can also achieve the equality in fl33|) if we perform the Szilard-engine-type 
operation while the engine is in contact with the hot heat bath. In this case, we can 
extract k^TuI of excess work, and Qu is increased by k^TnI. Therefore, we again 
obtain the equality in fl5^ with J = ln2. 

5 Nonequilibrium Equalities with Feedback Con- 
trol 

Since the late 1990's, a number of universal equalities have been found for nonequi- 
librium processes [67H75] . and they have been shown to reproduce the second law of 
thermodynamics and the fluctuation-dissipation theorem. The fluctuation theorem 
and the Jarzynski equality are two prime examples of the nonequilibrium equalities. 
In this section, we generalize the nonequilibrium equalities to situations in which a 
thermodynamic system is subject to measurements and feedback control in line with 
Refs. [26l[29llll] . As corollaries, we derive inequalities ( 125|) and (133|) . 

5.1 Preliminaries 

First of all, we review the nonequilibrium equalities without feedback control. We 
consider a stochastic thermodynamic system in contact with heat bath(s) with inverse 
temperatures (3m {m = 1, 2, ■ ■ ■ ). Let x be the phase-space point of the system. The 
system is controlled through external parameters A, which describe, for instance, the 
volume of a gas or the frequency of an optical tweezer. Even when the initial state of 
the system is in thermal equilibrium, the system can be driven far from equilibrium 
by changing the external parameters. We consider such a stochastic dynamics of the 
system from time to r. The state of the system stochastically evolves as x{t) under 
a deterministic protocol of the external parameters denoted collectively as A(t). 
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Let Xt := {x{t)}o<t<T be the trajectory of the phase-space point and A^- : = 
{X{t)}o<t<T be that of the external parameters. The heat that is absorbed by the 
system from heat bath "m" is a trajectory-dependent quantity, which we write as 
Qmi-^rlA,-]. The work that is performed on the system is also trajectory-dependent 
and is denoted as iy[XT-|AT-]. The first law of thermodynamics is then given by 

H[x{t)\X{t)] - H[x{0)\X{0)] = W[X^\Ar] + J2Qm[Xr\Ar], (34) 

m 

where iJ[x|A] is the Hamiltonian with external parameters A, and the work can be 
written as 

W[Xr\Ar] = ^l^it)\m]^dt. (35) 

Let P[Xt-|At-] be the probability density of trajectory X^ with control protocol A,-. 
It can be decomposed as P[X^|At-] = P[X^|x(0), A^]Pf [x(0)], where P[X^|x(0), A^] is 
the probability density under the condition that the initial state is x(0), and Pf[x(0)] 
is the initial distribution of the forward process. 

We next introduce backward processes of the system. Let x* be the time-reversal 
of X. For example, if a; = (r, p) with r being the position and p being the momentum, 
then X* = (r, —p). Similarly, we denote the time- reversal of A as A*. For example, if 
A is the magnetic field, then A* = —A. Let X^. be the time-reversed trajectory of X,- 
defined as X| := {x*{t — t)}o<t<T- We also write the time-reversal of control protocol 
A^ as Al := {A*(r — t)}o<t<T, and write as P[X^|A!j^] the probability density of the 
time-reversed trajectory with the time-reversed control protocol. We can decompose 
P[Xt|At] as P[Xt|At] = P[Xt|xt(0), A,]Pb[xt(0)], where P[Xt|xt(0), At] is the prob- 
ability density under the condition that the initial state of the backward process is 
x^{0), and Pb[a;"'"(0)] the initial distribution of the backward process. We stress that 
the initial distribution of the backward process Pb[a;t(0)] can be set independently of 
the final distribution of the forward process. In experiments, we can initialize the sys- 
tem before we start a backward process so that its initial distribution can be chosen 
independently of the forward process. 

The detailed fluctuation theorem (the transient fluctuation theorem) is given 
by [68HZI] 

P[X.|x(0),A.] ■ ^^^^ 

Defining the entropy production as 

a[X.|A.] :=lnPf[a;(0)]-lnPb[a;t(0)]-5^Q„[X,|A.], (37) 

m 

we obtain 

P[Xt|At] _ _^[^^|^^ 



P[X.|A,] 



e-lA.|A.j^ (38) 
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By taking the ensemble average of Eq. fl38p . we have 

dX.PlX.IA.Je-'^l^-l^^] = I dX,P[XMr]j0j^^ = I dXlP[Xl\Al] = 1, 

(39) 

where we used dX^- = dX].. Therefore, we obtain the integral fluctuation theorem 

(e--) = 1. (40) 

By using the concavity of the exponential function, we find from Eq. fHOj) that 

{a) > 0, (41) 

which is an expression of the second law of thermodynamics. In the following, we 
discuss the physical meanings of entropy production a for typical situations. The 
equality in (HTj) is achieved if P[X^|A]^] = P[Xt-|At-] holds for any X^-, which implies 
the reversibility of the process. 

We first consider isothermal processes. In this case, we choose the initial distri- 
butions of the forward and backward processes as 

Pf[x(0)] = exp (/3(P[A(0)] - H[xmm])) , 
Pb[xt(0)] = exp(/3(P[At(0)]-iJ[xt(0)|At(0)])), ^ ' 

where P[A] := —k^T In J dxe'^^^^^"^^ is the free energy of the system. We assume that 
the Hamiltonian has the time-reversal symmetry 

H[x\\] = H[x*\\*], (43) 

and therefore the canonical distribution satisfies 

Pb[xt(0)] = exp (/3(P[A(r)] - if[x(r)|A(r)])) . (44) 

The entropy production then reduces to 

a[Xr] = P{W[X,]-AF), (45) 

where AP := P[A(r)] — P[A(0)]. Thus, the integral fluctuation theorem fj^Oj) leads to 
the Jarzynski equality [67] 

(e-/^^) = e-'^^^, (46) 
and inequality fHTl) gives the second law of thermodynamics 

{W) > AP, (47) 

where Wext := —{W) is the work that is extracted from the system. 

We next consider a case with multi-heat baths, and assume that the initial dis- 
tributions of the forward and the backward processes are given by the canonical 
distributions as in fH21) with a reference inverse temperature /3. In practice, /3 can be 
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taken as one of P^s, which can be reahzed if the system is initially attached only to 
that particular heat bath. We then have 

a[Xr] = P{AE[Xr] -AF)-J2 Mm[Xr], (48) 

m 

where Ai?[X^] := H[x{t)\X{t)] — if[a;(0)|A(0)] is the difference of the internal energy 
of the system. Inequahty fHT]) leads to 

l^^iQm) < Pi{AE) - AF). (49) 

m 

In particular, if the process is a cycle such that AF = and (AE) = hold, inequal- 
ity din]) reduces to 

J2/3m{Qm)<0, (50) 

m 

which is the Clausius inequality. If there are two heat baths with temperatures Th 
and Tl, (150|) gives the Carnot bound 

- W< (i-^)(Qh), (51) 

where (Qb) is the average of the heat that is absorbed by the engine from the hot 
heat bath. 



5.2 Measurement and Feedback 

We now formulate measurements and feedback on the thermodynamic system [26] 
We perform measurements at time {k = 1,2, ■ ■ ■ , M) with < ti < ^2 < 
■ ■ ■ < tM < T. Let y{tk) be the measurement outcome at time t^. For simplicity, we 
assume that the measurement is instantaneous; the measurement error of y{tk) can be 
characterized only by the conditional probability Pc[y{tk)\x{tk)], which implies that 
only y(tk) has the information about x{tk). (Note, however, that this assumption 
can be relaxed |11].) We write the sequence of the measurement outcomes as 1^ •= 
{yiti),yit2), ■ ■ • , yitM)), and write 

PJKIX.] := \[PMh)\x{tu)]. (52) 

k 

We then introduce the following quantity: 

/c[X.:K]:=ln^^^^, (53) 

which can be interpreted as a stochastic version of the mutual information. The 
ensemble average of Eq. flS^ gives the mutual information obtained by the measure- 
ments as 

Pc[Yr\Xr\ 



J dXrdYrP[Xr,Yr] In ^^^j^ 



(Jc) = / dXrdYrP[Xr, K] lu 'Zl ' (^4) 
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We note that (/c) describes the correlation between and that is induced only by 
measurements, and not by feedback control. The suffix "c" represents this property 
of Ic- We then identify (/c) with / in the foregoing arguments. See Ref. |44j for 
details. We note that {Ic) has been discussed and referred to as the transfer entropy 
in Ref. [76]. The following equality also holds by definition: 

(e^^^) = 1. (55) 

We next consider feedback control by using the obtained outcomes. The control 
protocol after time tk can depend on the outcome y{tk) in the presence of feedback 
control. We write this dependence as KriXr)- We can show that the joint probability 
of (Xt-, Yt-) is given by 

P[Xr,Yr] = Pc[Yr\Xr]P[Xr\K{Yr)l (56) 

which satisfies the normalization condition j dXT-dYrP[Xr,Yr] = 1. Equality (156|1 is 
proved in Appendix A (see also Ref. [B]). The probability of obtaining outcome Yr in 
the forward process is given by the marginal distribution as P[Yr] = J dXrP[Xr, Yr], 
and the conditional probability is given by P[X^|yr] = -P[X^, YrlZ-Pf^r]- In the pres- 
ence of measurement and feedback, the ensemble average is taken over all trajectories 
and all outcomes; for an arbitrary stochastic quantity A[X^, Y^], its ensemble average 
is given by 

{A) = J dXrdYrP[Xr,Yr]A[Xr,Yr]. (57) 
The detailed fluctuation theorem for a given Y^ can be written as 

p[xt|A.(n)t] _ _,[^^|^^(^^)] 



P[Xr\Ar{Yr)] 



We note that Eq. f lSS]) is valid in the presence of feedback control, because the detailed 
fluctuation theorem is satisfied once a control protocol is fixed. Equality (l58i) provides 
the basis for the derivations of the formulas in the following section. 

5.3 Nonequilibrium Equalities with Mutual Information 

In this subsection, we generalize the nonequilibrium equalities by incorporating the 
mutual information. First of all, from Eq. fl53|) . we have 



-e--: (59) 



By multiplying the both-hand sides of this equality by those of Eq. ( 155]) . we obtain 

P[Xl\Ar{Yry]P[Yr] ^ ^-^[Xr\AAYr)]-Ic[Xr:Yr] _ (^gQ) 
P[Xr, Yr] 
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To measure P[X^|AT-(yT-)''']P[yT-], we follow the backward process corresponding to 
each forward outcome and count the occurrences of the time-reversed trajectories. 
By taking the ensemble average of the both-hand sides of Eq. fl60l) with formula fl57|l . 
we obtain a generalized integral fluctuation theorem with feedback control: 

(e-"-^0 = 1. (61) 

Using the concavity of the exponential function, Eq. fl6Tl) leads to 

{a) > -(/,). (62) 

Inequality 0621) is a generalized second law of thermodynamics, which states that the 
entropy production can be decreased by feedback control, and that the lower bound 
of the entropy production is given by the mutual information (Jc). As shown below, 
inequalities fl25|) and fl33|) in Sec. 4 are special cases of inequality f l62l) . The equality 
in is achieved if P[Xt|A^(y,)t]P[F^] = P[X^, Y^] holds for any and Y^, which 
implies the reversibility with feedback control as discussed in Ref. [34J. 

The generalized integral fluctuation theorem of the form (16T|) was first shown 
in Ref. [26] for a single measurement, and Eq. (160|) was obtained in Ref. [29llllj for 
multiple measurements. These results has also been generalized to the optimal control 
process with continuous measurement and the Kalman filter in Ref. PE]. 

A generalized fluctuation theorem was also obtained in Ref. [T^, which is similar 
to Eq. fl6Tl) . In Ref. [19], feedback control is performed based on information about the 
continuously-monitored velocity of a Langevin system. The result of Ref. [19] includes 
an quantity that describes the decrease in the entropy by continuous feedback control, 
instead of the mutual information obtained by the continuous measurement. 

We consider isothermal processes with a single heat bath, in which the entropy 
production is given by Eq. (H5i) . Equality (16T|) then reduces to a generalized Jarzynski 
equality 

^^mF-w)-i.^ = 1, (63) 

and inequality fl62l) reduces to 

(AP-W^)>A;BT(/e), (64) 
which implies inequality f l25|) with identifications Wext = —{W), AF = (AP), and 

We next consider the cases in which there are two heat baths and the process is a 
cycle, in which the entropy production is given by the ensemble average of Eq. fHHl) 
with (AP) = (AP) = 0. The generalized second law (162|) then leads to 

MQn) + /3l{Ql) < (/c), (65) 

which can be rewritten as 

- {W) < (^1 - (Qh) + A;bTl(/c). (66) 
By identifying Qh = {Qh), inequality ( 1^ implies inequality 
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5.4 Nonequilibrium Equalities with Efficacy Parameter 

In this subsection, we discuss another generahzation of nonequihbrium equahties. We 
define the time-reversal of outcomes Yr as Yj := {y{T—tM)*, ■ ■ ■ , ^2)*, 2/(^—^1)*), 
where y* is the time-reversal of y, and introduce the probability that we obtain 
outcome Yj with control protocol At-IY^Y , which is given by 

P[y;|A.(n)t] = J dXlP^[Y^\Xl]P[Xl\Ar{Yry]. (67) 

We stress that no feedback control is performed in the backward processes. We then 
assume that the measurement error has the time-reversal symmetry 

P,[Y^\Xl] = P,[Yr\Xr]. (68) 

This assumption is satisfied if Pc[y{tk)\x{tk)] = Pc[y{T — tk)*\x{T — tk)*] holds for 
k = 1,2, ■ ■ ■ , M. By using Eq. ( I67|) and assumption ( 168|) . we can show that 

— m] — 

where (■ ■ ■ denotes the conditional average with condition Yr such that 

(e-'^)y, := J dX,e-'^[^-l^-(^-)lp[X,|K]. (70) 

Equality (169|) has been shown for Hamiltonian systems [72\ and stochastic systems [261 
m]. By noting that 

{e-n = J dYrP[Yr]{e-^)y^, (71) 
we obtain yet another generalization of the integral fluctuation theorem 



(e-'^) = 7, (72) 

where 

7= /" dKP[r;|A.(K)^] (73) 



is the sum of the probabilities that we obtain the time-reversed outcomes with a 
time-reversed protocol. For the cases of isothermal processes, Eq. fl72|) reduces to 

(^^mF-w)^ = 7. (74) 

We note that 7 characterizes the efficacy of feedback control. The more efficient 
the feedback protocol is, the larger the amount of 7 is. Without feedback control, 
P[yj!|A|^] reduces to a single unconditional probability distribution, and we therefore 
obtain 



7 



J dYrP[Yj\Ai] = 1, (75) 
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step 4: Removal of the barrier. 




Figure 9: Backward processes of the Szilard engine. See the text for details. 

which reproduces the integral fluctuation theorem (HO!) without feedback. We note 
that the maximum value of 7 is the number of the possible outcomes of Y^-. 

We illustrate the efficacy parameter 7 for the case of the Szilard engine that is 
described in Sec. 2. The backward control protocol of the Szilard engine is as follows 
(see also Fig. 9) [26] . 

Step 1: Initial state. The single-particle gas is initially in thermal equilibrium. 

Step 2: Compression of the box. In accordance with the measurement outcome in 
the forward process, which is "0" (= "left") or "1" (= "right"), we quasi-statically 
compress the box by moving the wall in the box to the center. By this compression, 
the volume of the box becomes half. 

Step 3: Measurement. We measure the position of the particle to find which box 
the particle is in. The outcome of this backward measurement is "0" (= "left") or 
"1" (= "right") with unit probability corresponding to forward outcome "0" or "1," 
respectively. 

Step 4: We remove the barrier at the center of the box, and the engine returns to 
the initial state by a free expansion. 

In these backward processes, the measurement outcomes in step 2 satisfy P[0|A^(0)^] = 
1 and P[l|A^(l)t] = 1, and therefore we obtain 7 = P[0|A^(0)t] + P[l|A^(l)t] = 2, 
which gives the maximum value of 7 for situations in which the number of possi- 
ble outcomes is two. On the other hand, since W = —k^T ln2 and AP = in 
the absence of fiuctuations, the generalized Jarzynski equality fl7i|) is satisfied as 

^^mF-w)^ = 2 = 7. 

The generalized Jarzynski equality ([71]) has been experimentally verified in the 
experiment described in Sec. 4.4 by measuring AP — W and 7 separately in the 
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forward and backward experiments, respectively [50! • Equalities and (1711) have 
been obtained in Hamiltonian systems [57]. Equality (171|) has also been generalized 
to quantum systems [32|H2]. 

While Eq. fl6T]) only includes the obtained mutual information and does not de- 
scribe how we utilize the information via feedback, Eq. fl72|) includes the term of 
feedback efficacy that depends on the feedback protocol. To quantitatively discuss 
the relationship between mutual information I^. and efficacy parameter 7, we define 
C{A] := — ln(e~^). By noting Eq. flS^ . we obtain 

C[a] + C[Q - C[a + Jc] = - In 7. (76) 

If the joint distribution of a and Jc is Gaussian, Eq. fl75]) reduces to 

(a/,)-(a)(/,) = -ln7. (77) 

Equalities (176|) and ([77]) imply that, the more efficiently we use the obtained informa- 
tion to decrease the entropy production by feedback control, the larger 7 is. In fact, 
if 7 is large, the left-hand sides of Eqs. (176|) and (1771) are both small, which means 
that the obtained information 1^ has a large negative correlation with a. Without 
feedback control, 7 = 1 holds and therefore Ic is not correlated with a. In this sense, 
7 characterizes the efficacy of feedback control. 

6 Thermodynamic Energy Cost for Measurement 
and Information Erasure 

So far, we have discussed the energy balance of information heat engines controlled 
by the demon. In this section, we discuss the energy cost that is needed for the demon 
itself, which has been a subject of active discussion [2l-[i| [77H92 ] . 

Suppose that the demon has a memory that can store the outcome obtained by 
a measurement. If the outcome is binary, the memory can be modeled by a system 
with a binary potential (see Fig. 10). Before the measurement, the memory is in the 
initial standard state 0. The memory then interacts with a measured system such 
as the Szilard engine, and stores the measurement outcome. Figure 10 illustrates a 
case with a binary outcome. Let pk be the probability of obtaining outcome k. After 
the measurement, the memory is detached from the measured system and returns 
to the initial standard state, which is the erasure of the obtained information. The 
central question in this section is how much work is needed for the demon during the 
measurement and the information erasure. 

Let be the free energy of the memory under the condition that the outcome 
is "fc." During the measurement process, the free energy of the memory is changed 
on average by AF^ := YlkPk^k^ ~ -^o^; where is the free energy of the initial 
standard state. If F^^s are the same for all fc's including k = (i.e., the memory's 
potential is symmetric), AF^ = holds for every {pk}- It has been shown [92] that 
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Figure 10: A schematic of the measurement and erasure of information for the case of 
an asymmetric binary memory. While the memory is in the standard state with unit 
probabihty before the measurement, the memory stores the measurement outcome 
in accordance with the state of the measured system. The measurement and era- 
sure processes are time-reversal with each other except for the fact that the memory 
establishes a correlation with the measured system during the measurement process. 



the averaged work W^g^^ that is performed on the memory during the measurement 
is bounded as 

W^,,, > AF^ - ksTH + k^TI, (78) 

where H := —Y2i^Pk^^Pk is the Shannon information of the outcomes and / is the 
mutual information obtained by the measurement. For the special case with AF^ = 
and H = I, the rhs of inequality flTSjl reduces to zero. 

On the other hand, during the information erasure, the change of the free energy 
of the memory is given by —AF^. The averaged work that is needed for the 

erasure process is bounded as 

W^^ > -AF^ + kjiTH. (79) 

If AF^ vanishes, inequality ( !79|) reduces to 

W^,, > k^TH, (80) 

which is known as the Landauer principle [,78j. The additional term — AF^ on the 
rhs of f l79|l arises from the asymmetry of the memory. By summing up inequalities 
f lTHj) and f l79|l . we obtain the fundamental inequality 

W^e.s + W^^^ > k^TI, (81) 

which implies that the work needed for the demon is only bounded by the mutual 
information if we take into account both the measurement and erasure processes. 

We stress that, while inequality ( [79|) is a generalized Landauer principle for the 
information erasure, inequality (I5T]) is completely different from the Landauer princi- 
ple. In fact, while the lower bound of the Landauer principle is given by the Shannon 
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information that characterizes the randomness of the measurement outcomes, the 
lower bound of (1811) is given by the mutual information that characterizes the corre- 
lation between the measured system and the measurement outcome. Moreover, both 
terms on the rhs of fl79|l is exactly canceled by the first and second terms on the rhs of 
flTSjl . The reason for the cancellation lies in the fact that the dynamics of the memory 
during the erasure process is the time-reversal of the measurement process, except 
for the fact that the memory interacts with the measured system and establishes a 
correlation (or equivalently, gains information) only in the measurement process (see 
also Fig. 10). The additional cost for the establishment of the correlation is given by 
the last term on the rhs of ( I78|) . which also appears in the rhs of (IHTl) . Therefore, the 
mutual information term in inequality (IHT]) is induced by the measurement process. 

Historically, there has been a lot of discussions [2H1] as to what compensates 
for the additional work of k^T ln2 which can be extracted from the Szilard engine. 
Szilard considered that an entropic cost must be needed for the measurement pro- 
cess [l5]. L. Brillouin [77] argued that we need the work greater than k^T ln2 for the 
measurement process, based on a specific model of measurement. Later, by explicitly 
constructing a model of the memory does not require any work for the measurement, 
C. H. Bennett argued that, based on the Landauer principle flHU]) . we always need the 
work of at least k^T ln2 for the erasure process [79l|85]. The key observation here is 
that the erasure process is logically irreversible while the measurement process can 
be logically reversible. In fact, if we assume that the Shannon information of the 
measurement outcome equals the thermodynamic entropy of the memory, the logi- 
cally irreversible erasure should be accompanied by a reduction in the thermodynamic 
entropy of the memory, which implies that k-^T In 2 of heat should be transfered to 
the heat bath and, therefore, the same amount of the work is needed. 

However, the argument by Landauer and Bennett is valid only for symmetric 
memories with AF^ = 0. As discussed in Refs. [SSH^ . the Shannon information 
does not equal the thermodynamic entropy of the memory in general. If the memory 
is asymmetric as illustrated in Fig. 10, the lower bound of the energy cost needed for 
the information erasure is not given by (IHOj) . and the Landauer principle needs to be 
generalized to inequality (179|) for asymmetric memories. We note that the Landauer 
principle can also be violated for symmetric memories in the quantum regime due 
to the initial correlation between the memory and the heat bath [551157 ]. A more 
detailed historical review about the Landauer principle is given in Ref. [Ij. 

As a consequence, the lower bound of the individual energy cost for measurement 
or erasure processes can be made arbitrarily small for asymmetric memories, while 
their sum (ISTl) is bounded from below by k^TI that originates from the measurement 
process. The total work given in the left-hand-side of fl5T]) then compensates for 
k^TI of additional work in (1231) that is extracted from an information heat engine by 
the demon. This compensation confirms the consistency between the demon and the 
second law of thermodynamics; we cannot extract any positive amount of work by a 
cycle from the total system consisting of the engine and the memory of the demon. 

Nevertheless, feedback control is still useful for manipulating small thermody- 
namic systems. In fact, as discussed in Sec. 2, feedback control enables us to increase 
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the engine's free energy without injecting energy to the engine directly. In other 
words, the work fl8ip needed for the demon is not necessarily transfered to the en- 
gine, which can be energetically separated from the demon. Therefore, by using 
information heat engines, we can control thermodynamic systems beyond the energy 
balance that is imposed by the conventional thermodynamics. 



7 Conclusions 

In this chapter, we have discussed a generalized thermodynamics that can be applied 
to feedback-controlled systems which we call information heat engines. The Szilard 
engine described in Sec. 2 is the simplest model of information heat engines. Based on 
the information theory reviewed in Sec. 3, we have formulated a generalized second 
law involving the term of the mutual information in Sec. 4. The generalized second 
law gives an upper bound of the work that can be extracted from a heat bath with the 
assistance of feedback control. We also discussed some typical examples of information 
heat engines including a recent experimental result [3D]. In Sec. 5, we discussed 
nonequilibrium equalities with feedback control, and derived the generalized second 
law discussed in Sec. 4. We also discussed the energy cost that is needed for the 
measurement and the information erasure in Sec. 6. 

Inequalities fl25|) . f lTHj) . fl79|) . and flHT]) are the generalizations of the second law of 
thermodynamics, giving the fundamental bounds of the work needed for information 
processing. In fact, if we set the information contents to be zero (i.e., I = H = 0) 
in these inequalities, all of them reduce to the conventional second law of thermo- 
dynamics. In this sense, these inequalities constitute the second law of "information 
thermodynamics," which is a generalized thermodynamics for information processing. 

While the studies of information and thermodynamics have a long history, recent 
developments of nonequilibrium physics and nanotechnologies have shed new light 
on classic problems from the modern point of view. Thermodynamics of information 
processing will open a fruitful research arena that enables us to quantitatively analyze 
the energy costs of the feedback control and information processing in small thermo- 
dynamic systems. Possible applications of this new research field include designing 
designing and controlling nanomachines [23] and nano devices. 



A Proof of Eq. ([56]) 



In this appendix, we prove Eq. fl5B]) . We introduce notations Xt^._^<t<tfc := {^i't)}tk_i<t<tk- 
Xt^ := {x{t)}o<t<tk, := {\{t)}o<t<tk, and Yt^ := {y{ti),--- ,y{h)). The joint 
probability of (X^, Yr) is given by 

M<t<-r|^tM> 

• npb(4)|x(4)]p[x,,_,<,<,jx,,_,, A,,(y;,_j] • p,[xr' ^^^^ 

fc=l 
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where we set to •= 0. We note that A^^. depends only on Yti^_^ due to the causahty. 
We also note that 

M 
k=l 

(83) 

By combining Eqs. fl52l) . fl82|) . and fl83|) . we obtain Eq. fl56|) . We can confirm that 
the joint probability satisfies the normalization condition J dX^-dYT-PlXT, Yj] = 1 by 
integrating Eq. ([H2D in the order of Xtj,j<t<T yitu) Xt^i.i<t<tM y(^Af-i) 
• • ■ — )• — )■ -'^o<t<ti ~^ 3;(0) due to the causality. 
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